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Abstract 

Non-relativistic quantum theory is derived from information codified 
into an appropriate statistical model. The basic assumption is that there 
is an irreducible uncertainty in the location of particles: positions consti- 
tute a configuration space and the corresponding probability distributions 
constitute a statistical manifold. The dynamics follows from a principle 
of inference, the method of Maximum Entropy. The concept of time is 
introduced as a convenient way to keep track of change. A welcome fea- 
ture is that the entropic dynamics notion of time incorporates a natural 
distinction between past and future. The statistical manifold is assumed 
to be a dynamical entity: its curved and evolving geometry determines 
the evolution of the particles which, in their turn, react back and deter- 
mine the evolution of the geometry. Imposing that the dynamics conserve 
energy leads to the Schroedinger equation and to a natural explanation of 
its linearity, its unitarity, and of the role of complex numbers. The phase 
of the wave function is explained as a feature of purely statistical origin. 
There is a quantum analogue to the gravitational equivalence principle. 



1 Introduction 

Our subject has been very succinctly stated by Jaynes: "Our present QM for- 
malism is a peculiar mixture describing in part realities in Nature, in part 
incomplete human information about Nature — all scrambled up by Heisenberg 
and Bohr into an omelette that nobody has seen how to unscramble." [T] He also 
understood where to start looking: "We suggest that the proper tool for incor- 
porating human information into science is simply probability theory — not the 
currently taught 'random variable' kind, but the original 'logical inference' kind 
of James Bernoulli and Laplace" which he explains "is often called Bayesian 
inference" and is "supplemented by the notion of information entropy" . Bohr, 
Heisenberg and other founders of quantum theory might have agreed. They 
were keenly aware of the epistemological and pragmatic elements in quantum 
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mechanics (see e.g., [2]) but they wrote at a time when the language and tools 
of quantitative epistemology had not yet been sufficiently developed. 

Our goal is to derive quantum theory as an example of entropic inference. A 
central feature is the privileged role we assign to position over and above all other 
observables. Position is, strictly, the only observable. This is one important 
difference from other approaches that also emphasize notions of information 
(see e.g., O-f^U])- The theory has formal similarities with Nelson's stochastic 
mechanics [2T]-[26], but there are important conceptual differences. Stochastic 
mechanics operates at the ontological level; its goal is a realistic interpretation 
of quantum theory as arising from a deeper, possibly non-local, but essentially 
classical "reality". In contrast, the entropic dynamics advocated here operates 
almost completely at the epistemological level. 

The statistical manifold associated to the configuration space is introduced 
in section 2. The basic dynamical question — the probability of a small step — 
is answered in section 3 using the method of maximum entropy |27) . (Earlier 
versions of related ideas were discussed in [2E]-[2I]) The introduction of time as 
a device to keep track of the accumulation of small changes is discussed in section 
4 and the Schroedinger equation is derived in section 5. An important new 
element is that the geometry of the statistical manifold is not a fixed background 
but a dynamical entity. This leads to a certain similarity with the theory of 
general relativity including a new quantum equivalence principle (section 6). 
Our conclusions are summarized in section 7. 

2 The statistical model 

We assume the usual configuration space X for a single particle, namely, a 
flat three-dimensional space. We further assume — and this is the crucial new 
element — that there is a very small but ultimately irreducible uncertainty 
in the location of the particle. The particle cannot be localized; at best we 
can specify its expected position x a — (y a ) and give a probability distribution 
p(y\x). Thus, when we say that the particle is at x what we mean is that the 
true but unknown position y is somewhere in the vicinity of x with probability 
p{y\x). 

To each point x € X there corresponds a probability distribution p(y\x) and 
the set of these distributions is also a three-dimensional manifold — a statistical 
manifold which we call A4. The same label x is used to denote points in X 
and the corresponding points in M. Points in M. are not structureless dots but 
probability distributions. 

The origin of the irreducible uncertainty is left unspecified. Indeed, the ar- 
guments below turn out to be remarkably independent of the particular choice 
of the functional form of p(y\x). Nevertheless, for the sake of clarity it is con- 
venient to choose a specific statistical model. We can reasonably assume that 
the distributions p(y\x) arise as the result of unknown microscopic influences, 
in which case, general arguments such as the central limit theorem lead us to 
expect that for a very wide variety of microscopic conditions the plausible dis- 
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tributions are Gaussians. We further assume that the Gaussians are spherically 
symmetric with a small non-uniform variance expressed as a small constant a 2 
modulated by a positive scalar field $(x), 



p(y\x) 



$ 3 / 2 (x) 

(27TO- 2 ) 3 / 2 



exp 



1 



where 



lab 



<&{x) lab {y a - x a ){y b - x b ) 



Sab 



(1) 



(2) 



is the metric in X. 

While the configuration space X is flat, the statistical manifold M. turns out, 
in general, to be curved. M inherits its unique geometry from the distributions 
p(y\x). The distance, d£ 2 = g a b dx a dx b , between x and x+dx, or better, between 
p(y\x) and p(y\x + dx), is given by the information metric [27 32] [33] , 



9ab 



dlogp(y\x) dlogp{y\x) 
dyp(y\x) ^ ^ 



dx a 



Substituting and ^ into ^ gives 



g a b(x) = -^Sab + 



2$ 2 



(3) 



(4) 



We will be interested in situations where the intrinsic uncertainties are very 
small. More precisely, when the change in $(x) over the support of p(y\x) is 
negligible, |<9$/<I>| 2 <C ^/c 2 , the metric simplifies considerably, 



9ab{x) 



$(x) 



Kb = ®(x)j a b , 



(5) 



and we see that plays the role of a conformal factor. 

For future reference, the entropy of p(y\x) relative to the flat measure of X 

is 

p{y\x) 3. $(z) 



S(x)=-J dyp(y\x) log 



^ 2 [ 1 - 1 ° g 2. 



(6) 



(The volume element in X is given by dv = 7 1 / 2 <i 3 iz; where 7 = det7 a b.) 

The generalization to N particles is straightforward. The 3iV-dimensional 
configuration space X^ remains flat but it is no longer isotropic. For example 
for N — 2 particles its metric is 



1AB 



S aibl /a 2 

5 a2 b 2 /(T 



(7) 



The position uncertainty is given by a Gaussian distribution p{y\x) in 3iV di- 
mensions, 

\Hx) lAB {y A - x A )(y B - x B )] , (8) 



p{y\x) oc exp 

where the index A takes the values 1 . 
metric 9ab{x) ~ $(x)jab- 



. 3N. The statistical manifold Ai n has 



3 



3 Law without law: entropic dynamics 



The basic dynamical information is that changes from one state to another are 
possible and do, in fact, happen. We do not explain why they happen but, 
given the information that changes occur, we want to venture a guess about 
what changes to expect. What gives this program some hope of success is 
the assumption that large changes result from the accumulation of many small 
changes. Therefore, our job separates into two parts, first we consider a small 
change, and then we hgure out how small changes add up. 

Consider a single particle that moves from an initial position x to an un- 
known final position x' . (The generalization to more particles is immediate.) 
What can we say about x' when all we know is that it is near x? Since x and 
x' represent probability distributions we use the method of maximum entropy 
(ME) [27 . As in all ME problems success hinges on appropriate choices of 
entropy, prior distribution, and constraints. Since neither the new x' nor the 
new microstate y' are known, the relevant universe of discourse is X x X and 
what we want to find is the joint distribution P{x' \y'\x) (34] . The appropriate 
(relative) entropy is 

S[P, n} = - [ dx'dy' P(x>, y'\x) log ■ ( 9 ) 

The relevant information is introduced through the prior Tr(x',y') and the con- 
straints that specify the family of acceptable posteriors P(x',y'\x). Consider 
the prior, ir(x',y') = ir(x')ir(y'\x'). Before the relation between the variables 
x' and y' is known the state of extreme ignorance is represented by a product, 
n(x' , y') = Tr(x')n(y') — knowledge of x' tells us nothing about y' and vice versa 
- and the probabilities n(y')d 3 y' and ir(x')d 3 x' are uniform, that is, propor- 
tional to the respective volume elements ~f 1 / 2 d 3 y' and ^ x / 2 d 3 x' . Proportionality 
constants are not essential here; we set 7t(t/') = 7 1 / 2 and n(x') = 7 1 / 2 and the 
prior is 

K{x',y') = 7 . (10) 

Next consider the possible posteriors P(x',y'\x) = P(x'\x)P(y'\x' , x). Be- 
sides normalization we impose two constraints. First we have the known relation 
between x' and y ': the particular functional form of P(y'\x' , x) is given by eq.Q. 
Therefore, 

P(x',y'\x) = P(x'\x)p(y'\x') . (11) 

The second constraint concerns the factor P{x'\x): we know that x' is only a 
short step away from x. Let x' a = x a + Ax a . We require that the expectation 

(At 2 ) = ( lab Ax a Ax b ) = \ 2 {x) (12) 

be some small but for now unspecified numerical value X 2 (x) which might per- 
haps depend on x. (Provided the steps are sufficiently short their actual length 
is not particularly critical; they do not even have to be all of the same length.) 
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Once prior and constraints have been specified the ME method takes over. 
Substituting §jxfy and ^ into ([9} gives 



S[P,tt] = -J dx' P(x'\x)\og ^JP + J dx' P{x'\x)S{x') , 



(13) 



where S(x) is given in cq.([6]). Varying P(x'\x) to maximize S[P,n] subject to 
( 12 ) and normalization gives 



P(x'\x) = 



C(x,a) 



D S(x')-±a(x)Ae 2 



(14) 



where a) is the normalization constant, and the Lagrange multiplier a(x) is 
determined from <91ogC/9a = — A 2 /2. We see that large values of a(x) clearly 
lead to short steps. 

P(x'\x) gives the probability of a step from x to x' . It is the basic building 
block out of which all dynamics is constructed. It implements what Wheeler 
foresaw as "law without law": "that every law of physics, pushed to the ex- 
treme, will be found statistical and approximate, not mathematically perfect 
and precise." [35] 

The most probable displacement Ax a is that which maximizes the scalar 
probability density. For large a{x) we expect Ax a to be small. Then 







d 



S(x') - l -a{x) lbc Ax b Ax c 



dx' a 

so that the maximum occurs at 

Ax a 



d a S(x) - a{x) lab Ax b (15) 



1 



a{x) 



l ab d b S{x) , 



(16) 



which shows that the particle tends to drift up the entropy gradient. Expanding 
ab 

1 



the exponent of ( 14 ) about its maximum gives 

a(x) 



P{x'\x) 



exp 



2a 2 



S ab {Ax a - Ax a ) (Ax b - Ax b ) 



(17) 



with a new normalization Z(x). The displacement Ax a can be written as its 
expectation plus a "fluctuation" , 



Ax a = Ax a + Aw a , 



where 



(Ax a ) = Ax a 



-S ab d b S(x) 



a(x) 

(Aw a ) = and (Aw a Aw b ) 



a(x) 



(18) 

(19) 
(20) 



We see that as a — > oo the steps get smaller, Ax a — > as a -1 , but the fluctua- 
tions become dominant because Aw a — > only as a" 1 / 2 . This implies that as 
a — > oo the trajectory is continuous, but not diffcrcntiable — just as in Brownian 
motion. 
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4 Time 



To keep track of the accumulation of small changes we need to introduce a 
notion of time. Our task here is to develop a model that includes (a) something 
one might identify as an "instant" , (b) a sense in which these instants might 
be "ordered" , and (c) a useful concept of "duration" measuring the interval 
between instants. This set of concepts constitutes what we will call "time" in 
entropic dynamics. A welcome bonus is that there is an intrinsic directionality 
from past to future instants; an arrow of time is generated automatically and 
need not be externally imposed. 

The foundation to any notion of time is dynamics. Given an initial position 
we have some idea, given by P(x'\x), of what the next position might be. For 
all steps after the first, however, we are uncertain about both the initial x and 
the final step x', which means we must deal with the joint probability P{x' , x). 
Using the product rule P(x',x) = P(x'\x)P(x) and integrating over x, we get 

P(x') = JdxP(x'\x)P(x) . (21) 

If P(x) happens to be the probability of different values of x at a given instant 
of time t, then it is tempting to interpret P{x ! ) as the probability of values of 
x' at a later instant of time t' = t + At. Accordingly, we write P(x) = p(x, t) 
and P(x') — p(x',t') so that 

p(x',t') = JdxP{x'\x)p(x,t) . (22) 

Nothing in the laws of probability forces this interpretation on us — it is an 
independent assumption about what constitutes time in the model. We use 



eq.{22) to define what we mean by an instant: if p(x,t) refers to an "initial" 



instant, then we use p(x',t / ) to define what we mean by the "next" instant. 



Thus, eq.(22| allows time to be constructed, step by step, as a succession of 
instants. 

Specifying the interval of time At between successive instants amounts to 
tuning the steps, or equivalently a(x), appropriately. To model a "Newtonian" 
time that flows "equably" everywhere, that is, at the same rate at all places and 
times we define At as being independent of x, and such that every At is as long 
as the previous one. Inspection of the actual dynamics as given in eq.( l7p0 l 
shows that this is achieved if we choose a{x) so that 

7- 

a(x) — — — constant . (23) 

where r is a constant introduced so that At has units of time. 

Thus, it is the equable flow of time that leads us to impose uniformity on 
the step sizes A 2 (a;) and the corresponding multipliers a(x). This completes 
the implementation of Newtonian time in entropic dynamics. In the end, how- 
ever, the only justification for any definition of duration is that it simplifies the 
description of motion, and indeed, eqs.( 18p0| are simplified to 



Ax a — b a (x)At + Aw a (24) 
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where the drift velocity b a (x) and the fluctuation Aw a are 



b a (x) 



S ab d b S(x) , 



(Aw a ) = and (Aw a Aw b ) = —AtS ab . 



(25) 
(26) 



Equation ( 25 ) gives the mean velocity to the future or future drift, 



(x a (t + At )_ m - x a (t) i 

b a (x)= lim — — = lim — fdx'P(x'\x)Ax a (27) 

v ' At^o+ At At^o+ At J y 1 J v ' 



where x = x(t), x' = x(t + At), and Ax a = x' a — x a . The expectation in (27) is 
conditional on the earlier position x = x{t). One can also define a mean velocity 
from the past or past drift, 



K(x)= lim 

At->0+ 



x a (t)-{x a (t-At)) x(t) 



At 



(28) 



where the expectation is conditional on the later position x — x(t). Shifting the 
time by At, 6" can be equivalently written as 



K{x') = lim 

At->0+ 



x a (t + At) - (x a (t))^ 
At 



(t+At) 



= lim -J- fdxP(x\x')Ax a 
At^o+ At J y 1 ; 



(29) 

with the same definition of Ax a as in eq.(27). 

The two mean velocities, to the future b a , and from the past 6°, need not 
coincide. The connection between them is well known |2"T1|23I. 



K(x,t) = b a (x) - —d a log P (x,t) , 



(30) 



where d a = S ab d b and p(x,t) = P(x). What might not be widely appreciated is 
that cq.(30) is a straightforward consequence of Bayes' theorem, 



(31) 



(For a related idea see [35].) The proof of eq.(30) is lengthy but straightforward; 
the crucial step is to Taylor expand P(x') about x in (31 ) to get 



P{x\x') = [1 - (d b log p) Ax b + .. .] P{x'\x) 



(32) 



which accounts for the d log p term in eq. ( 30 ) . 

The fact that b a ^ 6° is very significant because it signals an asymmetry 
in time. The arrow of time, constitutes a puzzle that has plagued physics ever 
since Boltzmann. The standard formulation of the problem is that the laws of 
nature are symmetric under time reversal but everything else in nature indicates 
a clear asymmetry. How can that be? 
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The notion that the laws of physics might be rules for the manipulation of 
information and not necessarily laws of nature offers a new perspective on this 
old puzzle. Note that entropic dynamics does not assume any underlying laws of 
nature — whether they be symmetric or not — and it makes no attempt to ex- 
plain the asymmetry between past and future. The asymmetry is the inevitable 
consequence of entropic inference. From the point of view of entropic dynamics 
the challenge does not consist in explaining the arrow of time; on the contrary, it 
is the reversibility of the laws of physics that demands an explanation. Indeed, 
time itself always remains intrinsically directional even when the derived laws 
of physics turn out to be fully reversible. 



5 The Schroedinger equation 



Time has been introduced as a useful device to keep track of the accumulation of 
small changes. The technique to do this is well known from diffusion theory. The 
equation of evolution for the distribution p(x,t), derived from eq.(22| together 

(33) 



with (|24|-([26j), is the Fokker-Planck equation, [23] [37 

dtp = ~d a (b a p) 



2r 



V 2 p 



where S ab d a d b = V 2 . Using eq.(30) it can be expressed in terms of 6° 



dtp = ~d a (Kp) 



2t 



V 2 p 



(34) 



Adding eqs.(33) and (34) gives a continuity equation 
dtp 



-d a (v a p) where v a = - (b a + J£) , (35) 

where v a is interpreted as the velocity of the probability flow or the current 
velocity. On the other hand, eq.(30) gives yet another "velocity" 

u a^l {b a_ b a ) = _^ d a logp (3g) 

I It 

Its interpretation follows from v a = b a +u a . The drift b a represents the tendency 
of the probability p to flow up the entropy gradient while u a represents its 
tendency to slide down the density gradient. The situation is analogous to 
Brownian motion where the drift velocity is the response to the gradient of 
an external potential, while u a is a response to the concentration gradient - 
the so-called osmotic force. Accordingly, u a is called the osmotic velocity. Its 
contribution to the probability flow is the actual diffusion current, 



pu 



2t 



d a P , 



(37) 



which can be recognized as Fick's law, with a diffusion coefficient given by 
ct 2 /2t. 
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Since both the drift b a and the osmotic velocity u a are gradients, it follows 
that v a — b a + u a is a gradient too, 



2 

v a = — d a (f> where </>(x, t) = S(x) - log p 1/2 (x, t) . (38) 
r 



Eqs.( 33|38 1 provide the complete answer to the problem as originally posed: 



what is the expected evolution of a particle afflicted by the intrinsic uncertainties 
described by p(y\x) in cq.Q? Unfortunately, we can see that the dynamics 
described by eqs.( 33p8 ) is not quantum mechanics; it is just diffusion. Indeed, 



in order to construct a wave function, ^> = /9 1 / 2 e^, in addition to the density p 
we need a second degree of freedom, a phase </>. 



Note that the function 4>{x,t) in eq.(38) does not (yet) qualify as an in- 
dependent degree of freedom because the entropy S(x) — or equivalently the 
conformal factor $(x) — is an externally prescribed field. As long as the statis- 
tical manifold is a fixed static manifold there is no logical room for additional 
degrees of freedom. To promote <j)(x,t) to an independent degree of freedom we 
are forced to allow the manifold A4 itself to participate in the dynamics. 

To specify the dynamics of the manifold we follow Nelson and assume that 
it is "conservative" [22.. Requiring that some "energy" be conserved may seem 
natural in that it clearly represents physically relevant information but we feel 
that it is an assumption that demands a deeper justification. Normally energy 
is whatever happens to be dynamically conserved as a result of invariance under 
translations in time. But our dynamics has not yet been fully defined; what, 
then, is "energy" and why should it be conserved in the first place? This is a 
question we leave for the future. At this early stage, for the purpose of deriving 
a non-relativistic model, we just propose an intuitively reasonable conserved 
energy and proceed. 

The energy is chosen to be a local functional that includes a term represent- 
ing a potential energy and includes terms in the velocities that are invariant 
under time reversal and under rotations. Under time reversal, t — > —t, we have 
b a — > —6" , v a — > —v a , u a -> u a . For low velocities this means we need only 
include quadratic terms in the velocities, v 2 and u 2 |26j . The proposed energy 
functional is 

E[p, v] = J d 3 x p(x, t) (A lab v a v b + B lab u a u b + V{x)) , (39) 

where A and B are constants. In order that E have units of energy A/a 2 and 
B/a 2 must have units of mass. Then 

E[p, v}= j d 3 x p(x, t) (^mv 2 + ^pu 2 + V(x)j , (40) 

where m = 2 A/ a 2 and p = 2B/a 2 will be referred to as the "mass" and the 
"osmotic mass" respectively. It is further convenient to combine the constants 
r and A into yet a new constant 77, which relates the units of mass or energy 
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with those of time, 



V 



2A , a 2 
— so that — 

T T 



v_ 

m 



Then the current and osmotic velocities, eqs.(38) and (36), are 
mv a = r)d a <f> and mu a = r]d a \ogp 1/2 , 



while (40) becomes 
E = 



dx p 



|V0) 2 + £j(d Q io g p) 2 + v 



(41) 
(42) 

(43) 



Next we impose that the energy E[p,<p] be conserved. After some manipu- 
lations involving integration by parts and the continuity equation, 



-d a (pv a ) 



d a (pd a <t>) 



the time derivative E of ( 43 1 is 
E = dx p r](j> + 



2m 



m 



V 



(d a pd a <p + pV 2 4>) 



2m 2 



2 V 2 p l/2 
oV2 



(44) 



(45) 



Requiring that E = for arbitrary choices of p [which follows from arbitrary 
choices of p and <fi in eq.(44)] we get 



v4>+^-(d a 4) 2 + v 

Zm 



m 2 V 2 p 1/2 
2m 2 oV2 







(46) 



Equations (44) and (46) are the coupled dynamical equations we seek. The 



evolution of p(x, t) in eq.( |44[ ) is determined by <j>(x, t); the evolution of 4>(x, t) in 
eq.(46), is determined by p[x,t). The evolving geometry of the manifold enters 
through <p(x, t). 

Next we show that, with one very interesting twist, the dynamical equations 
turn out to be equivalent to the Schroedinger equation. We can always com bine 
the functions p and <f> into a complex function W — p 1 / 2 exp(i<f>). Then eqs.(44) 
and ( 46 1 can be rewritten as 



2 2 

2m 2m 



1 - 



a/2 



mJ (W*)i/2 



4- . 



This reproduces the Schroedinger equation, 



n^— = — - 



-V 2 * + F* , 



(47) 



(48) 



dt 2m 

provided the osmotic mass is identified with the mass, fi = m, and rj is identified 
with Planck's constant, rj — h. Setting Sj = r\<\> in eq.(46) and letting 77 — s- 
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leads to the Hamilton- Jacobi equation and thus the classical limit n = % — » 
allows one to identify to with the inertial mass of the particle. 

But why should p = m? This question is so central that we devote the next 
section to it. But before that we note that the non-linearity in eq.(47) is undesir- 
able both for experimental and theoretical reasons. From the experimental side 
non-linear terms have been ruled out to an extreme degree through precision ex- 
periments on the Lamb shift [24] and even more so in hyperfine transitions |38j . 
From the theory side there is a consistency argument that links the linearity of 
the Hilbert space with the linearity of time evolution; retaining one and not the 
other leads to inconsistently assigned amplitudes [5]. And, further, it has been 
argued that the non-linear terms can lead to supcrluminal communication |39] . 
Therefore it is extremely probable that the identity of inertial and osmotic mass 
is exact. 

Among the many mysteries of quantum theory there is one — the central 
role played by complex numbers — that turns out to be related to these issues. 
The dynamical equations (44) and (46) contain no complex numbers but they 
can always be written in terms \I/ and instead of p and <p. There is no 
mystery there. The statement that complex numbers play a fundamental role 
in quantum theory is the non-trivial assertion that the equation of evolution 
contains only ^ and not both ^ and ^* . In the entropic approach both the 
linear time evolution and the special role of complex numbers are linked through 
the equality fi = to. 



6 A new equivalence principle 

The generalization to many particles is easy. The conserved energy (for N = 2) 
[see eq.([7|] is 



E = J d 3N xp(x,t) (Ajabv a v b + B lAB u A u B + V{x)) 



d e x p(x, t) ( ^mivf + ^m 2 vl + ^Hiuj + ^p 2 uj + V(x) 



(49) 



where we introduced inertial and osmotic masses, m n = 2A/a^ and /i„ = 
2B/a^. Note that the ratio of osmotic to inertial mass turns out to be a uni- 
versal constant, the same for all particles: fi n /m n = B/A. But why should 
fi n = to„ exactly? To see this let us go back to eq.(43). We can always change 
units and rescale r\ and r by some constant k into i] = kt]', t — t' j k. If we also 
rescale 



into 



'/k, eqs.(44) and (43) become 



= -5-(0°p0 a ^ + pvY) 



2 II 
[IK T] 

8m 2 



(c> Q logp) 2 + ^ 



(50) 
(51) 
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Then we can introduce a different wave function as = p 1 ^ 2 exp(z</>') which 
satisfies 

W # - -^W + + £ fl- ^ gg^f. (52) 
' 2m 2m V my (*'*'*)!/2 v y 

Since the mere rescaling by k can have no physical implications the different 
regraduated theories are all equivalent and it is only natural to use the sim- 
plest one: choose k = (A/B) 1 / 2 so that fin 2 — m and rescale the old p to 
a new osmotic mass p! = pn 2 — m. We conclude that whatever the value of 
the original coefficient p it is always possible to regraduate r\, <fi and p to an 
equivalent but more convenient description where the Schroedinger equation is 
linear and complex numbers attain a special significance. It is the rescaled value 
rj of the linear theory that gets numerically identified with Planck's constant 
h. From this perspective the linear superposition principle and the complex 
Hilbert spaces are important because they are convenient, but not because they 
are fundamental — a theme that was also explored in [9]. 

These considerations remind us of Einstein's original argument for the equiv- 
alence principle: We accept the complete physical equivalence of a gravitational 
field with the corresponding acceleration of the reference frame because this 
offers a natural explanation of the equality of inertial and gravitational masses 
and opens the door to an explanation of gravity in purely geometrical terms. 

Similarly, in the quantum case we accept the complete equivalence of quan- 
tum and statistical fluctuations because this offers a natural explanation of the 
Schroedinger equation — its linearity, its unitarity, the role of complex numbers, 
the equality of intertial and osmotic masses — and opens the door to explaining 
quantum theory as an example of entropic inference. 



7 Conclusions 

On epistemology vs. ontology: Quantum theory has been derived as an 
example of entropic dynamics. The discussion is explicitly epistemological — 
it is concerned with how we handle information and update probabilities. Of 
course, once we know x we can immediately make inferences about the unob- 
served "true" position y. But this is precisely the point: the relation between 
the "laws of physics" (the Schroedinger equation) and "actual reality" is less 
direct than it is commonly assumed. 

On interpretation: Ever since Born the magnitude of the wave function 
|^| 2 = p has received a statistical interpretation. Within the entropic dynamics 
approach the phase of the wave function is also recognized as a feature of purely 



statistical origin. We can make this explicit using eq.(38) to write the wave 
function as 



* = p— e lb (53) 

where p is a probability density and S is an entropy — it is the entropy associated 
to each point on the statistical manifold Ai. 
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On dynamical laws: The principles of entropic inference form the backbone 
of this approach to dynamics. Energy conservation was introduced as an im- 
portant constraint in the present non-relativistic theory. One can safely expect 
that in a fully relativistic theory it will not survive in its current form. The 
peculiar features of quantum mechanics such as non-locality and entanglement 
arise naturally by virtue of the theory being formulated in the 37V-dimensional 
configuration space. 

On time: Time was introduced to keep track of the accumulation of small 
changes and its particular form was chosen to simplify the description of evo- 
lution. We have proposed a scheme that models temporal order, its duration, 
and most interestingly, its directionality. 

Equivalence principle: The derivation of the Schroedinger equation from 
entropic inference led to a surprising similarity with general relativity. The 
statistical manifold M is not a fixed background but actively participates in 
the dynamics. The potential for uncovering deeper relations between quantum 
theory and gravitation theory looks promising. 
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